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Q . Abstract 

■ We develop a new framework that extends the quantum walk framework of Magniez, Nayak, 

Roland, and Santha, by utilizing the idea of quantum data structures to construct an efficient 
cj ■ method of nesting quantum walks. Surprisingly, only classical data structures were considered 

<*S | before for searching via quantum walks. 

The recently proposed learning graph framework of Belovs has yielded improved upper 
bounds for several problems, including triangle finding and more general subgraph detection. 
We exhibit the power of our framework by giving a simple explicit constructions that rcpro- 
q-( ducc both the <3(n 35 / 27 ) and 0(n 9 / 7 ) learning graph upper bounds (up to logarithmic factors) 

for triangle finding, and discuss how other known upper bounds in the original learning graph 
framework can be converted to algorithms in our framework. We hope that the ease of use of 
this framework will lead to the discovery of new upper bounds. 
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1 Introduction 



O" 1-1 Background 

CN . Shenvi, Kempe, and Whaley [SKW03] first pointed out the algorithmic potential of quantum walks 

by designing a walk based algorithm to emulate Grover search [Gro96]. The first algorithm using 
quantum walks that goes beyond the capability of Grover search is due to Ambainis [Amb04] 
for element distinctness. In his seminal paper he resolved the quantum query complexity of the 
problem, settling a difficult question that had been open for several years [BDH + 05, AS04]. Based 
on Ambainis' quantum walk, Szegedy [Sze04] developed a general theory of quantum walk based 
algorithms. He constructed the quantum analogue W(P) of any symmetric, ergodic Markov chain 
P, and designed a quantum search algorithm based on the use of W{P). Later, this was generalized 
to reversible, ergodic Markov chains P, and the construction simplified by the use of quantum phase 
estimation [MNRS11, MNRS12]. 

After a series of new quantum walk algorithms with substantially better complexity in a variety 
of contexts [BS06, NM07, MSS07, ACR+10, CK11, Gall2], a new framework was introduced by 
Belovs, known as learning graphs [Bell2b]. Learning graphs are a model of computation related to 
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span programs [Rei09]. Introduced by Reichardt in the model of quantum query algorithms, span 
programs have been used to characterize quantum query complexity in terms of the negative adver- 
sary method [Rei09, Reill, LMR + 11]. Whereas explicitly constructing a span program is difficult 
in practice, coming up with a learning graph is more easily accomplished. Indeed, Belovs [Bell2b] 
gave a learning graph whose query complexity improved the best known quantum query algorithms 
for triangle finding. Later on, other learning graphs were constructed for constant-size subgraph 
finding [Zhull, LMSll], and also for /c-element distinctness [BL11, Bell2a]. Recently, a simpler 
and more efficient learning graph was proposed for triangle finding, together with a new application 
to associativity testing [LMS13]. 

In order to get an explicit quantum algorithm, one has to transform a learning graph into a 
span program, and then to an algorithm. This transformation is generic but incorporates many 
technical difficulties. The resulting algorithm is quite complicated. For instance, the time and space 
complexities may not be optimal. For specific problems, such as st-connectivity [BR12], one can 
find explicit implementations of span programs. Still, one would like to have a framework which 
leads easily to the explicit algorithm and its complexity (not only query but also time and space, 
for instance), as is the case in the setting of search algorithms based on quantum walks. 

1.2 Quantum walk based search algorithms 

At an abstract level, many search problems may be cast as the problem of finding a "marked" 
element from a set f2 with n elements. Let M C Q be the set of marked elements. Assume that 
the search algorithms maintain a data structure T> that associates some data T>(u) with every state 
u£ O. From T>(u), we would like to determine if u G M. In this framework, Grover search, and 
more generally amplitude amplification, consists of the following procedure, where tt is some given 
probability distribution on 0. 

1. Set up a superposition |vr) = ^ u ^/W^\u) of elements h£(1 

Compute T>, that is \u) i— > \u)\T>(u)), for all elements u in the current superposition 

2. Iterate times, where e = Pr 7r (n G M). 

(a) Flip marked elements, that is \u)\T>(u)) i— > — \u)\T>(u)), if u G M 

(b) Uncompute T>, that is \u)\T)(u)) \— > \u) 
Apply a reflection through | tt) 

Compute V, that is |u) i-> \u)\V(u)), for all elements u in the current superposition 

3. Output u if it is marked, otherwise output 'no marked element' 

When operating with D, we distinguish two types of cost. 

Setup cost S: The cost of creating | vr) and of constructing the data structure T>(u) for the 
state u. 

Checking cost C: The cost of checking if u G M using V(u). 

These costs may be thought of as vectors listing all the measures of complexity of interest, such 
as query and time complexity. Then the above algorithm has cost of order -}|(S + C). This 
above approach is a priori only valid when the probability Pr 7r (n G M) is known in advance. 
This assumption may be removed using standard techniques, without increasing the asymptotic 
complexity of the algorithms [BBHT98]. Indeed, if only a lower bound e > on Pr 7r (u G M) is 
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known for non-empty M, then the above argument can be modified in order to determine if M is 
empty or find an element of M. 

A more cost-effective approach is to implement the reflection through \ir) using the quantum 
analogue W(P) of a reversible and ergodic walk P on a given graph G. In this way, we can ensure 
that u has only small changes at each step, permitting us to update its data structure T>(u) without 
uncomputing/computing it. We then introduce a third type of cost. 

Update cost U: The cost of simulating one application of W(P) according to the Markov 
chain P, and of updating T>{u) to T>{v) where (u, v) £ G. 

Then finding a marked element, if there is one, can be done with cost of order S + -^(-^=U + C), 
where 5 is the spectral gap of P [MNRS11]. 

1.3 Our contribution 

In this paper, we introduce a new model of quantum walks which can reproduce the best known 
learning graph upper bounds for triangle finding and related problems (up to log factors). Our 
framework is also simple and leads to explicit algorithms that are much simpler to analyze than 
learning graphs. 

To achieve our goal, we introduce the notion of quantum data structures for nested quantum 
walks. Surprisingly, only classical data structures were considered before for searching via quantum 
walk. We demonstrate the potential of this framework by giving explicit and simple quantum algo- 
rithms for triangle finding, achieving the same query complexity as their learning graph analogues 
up to polylogarithmic factors (Section 3.4) and more generally, reproducing all subgraph contain- 
ment upper bounds of [LMS13] up to polylogarithmic factors, and potentially improving on these 
upper bounds for certain graphs (Section 4.2). 

Concretely, assume that we introduce another quantum walk based search as a checking sub- 
routine. Let Pi and P2 be, respectively, the outer and inner walks, with respective costs Si, Ui, Ci 
and S2, U2, C2, and parameters e±, S± and £2, 62- Since the checking cost of the outer walk can be 
expressed as Q = 0(S2 + -^(—^-l^ + C2)) the global of cost of finding a marked element is of 

V ^2 V 02 

order (neglecting log factors) 

Sl+ v^i \VSi Ul + Sl + v^ 2 (\/i*2 U2 + C2 )) ' 

We will show that allowing a quantum data structure for the outer walk, one can push the setup 
cost S2 of the inner walk to the outer one. In order to accomplish this, we include in the data 
structure T>(u), for each state u of the outer walk, the initial state for the inner quantum walk. It is 
then necessary that the initial state for the inner walk be properly preserved by the outer update, 
as well as the inner walk, so that we needn't perform the inner setup multiple times. In equations, 
this leads to an improved cost of order (Theorem 3.2): 

S ' + S2 + ^(v^ U;+ vk(^ U2 + C2 ))' 

where some logarithmic factors are now omitted. The new update cost U'i is the cost for updating 
the two data structures when one step of the outer walk is performed. In our application, U'i will 
be of the same order as Ui. Note that the update of the inner walk may be performed much more 
often than the update of the outer walk. Some of the power of our nested framework comes from 
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keeping expensive operations in the outer update, and pushing only cheaper operations into the 
inner update. 

Our very simple idea seems for now as powerful as most learning graph based algorithms [Bell2b, 
BL11, Zhull, LMS11, LMS13], except the recent one for fc-element distinctness [Bell2a], which 
deviates significantly from the original learning graph framework. Nonetheless, we conjecture that 
one might use our framework, possibly with further extensions, to design a similar algorithm. 

The idea is a posteriori very natural. Let us make an analogy with the adversary method. The 
method was first designed with positive weights by Ambainis [AmbOO]. Only several years later, 
the possibility to put negative weight was proven to be not only possible but useful. Later, using 
the dual of this method, span programs were proven to achieve optimal quantum query complexity. 
In a more modest direction, we hope that our contribution will lead to a better understanding of 
the power of quantum walks and learning graphs. 

While quantum walks are a powerful computational primitive, our main motivation for extend- 
ing the quantum walk framework to include recent learning graph algorithms was not to demonstrate 
that fact. We believe that the framework will be useful for designing new quantum algorithms, 
since it allows the use of intuition gained from experience with quantum and random walks. Some 
ideas, like function composition and subroutines, are very natural in the quantum walk framework, 
but not as straightforward in the learning graph framework. We hope that the different perspec- 
tives provided by the quantum walk framework and the learning graph framework will lead to the 
discovery of new quantum algorithms. 

2 Preliminaries 

2.1 Model and notation 

Our framework applies to other notions of complexity such as time and space. Nonetheless we will 
only illustrate it within the model of query complexity. In the model of quantum query complexity, 
the input string x is accessed by querying a black box to learn the bits of x. More concretely, the 
black box performs the following unitary operation: O x : \j,b) \— > \ j,b (B xj). The quantum query 
complexity of a quantum algorithm is the number of times the algorithm uses the operator O x . 
In this paper, x will always refer to the input string. For a more thorough introduction to the 
quantum query model, see [BBC + 01]. 

For problems where the input is a graph, such as the triangle finding problem, x will be the 
adjacency matrix of an undirected graph. The triangle finding problem asks if the input graph 
contains a triangle, i.e., three pairwise adjacent vertices. A vertex is called a triangle vertex if it is 
part of a triangle. Similarly, an edge is called a triangle edge if it is part of a triangle. The best 
known triangle finding algorithms solve, as a subroutine, the graph collision problem. In the graph 
collision problem parametrized by a graph G on n vertices, the input is a binary string of length 
n that indicates whether a vertex in G is marked or not. The objective is to determine whether 
there exist two adjacent vertices in G that are both marked. Note that the graph G is part of the 
problem specification, and not part of the input. 

We will use the O notation to suppress logarithmic factors. More precisely, /(ft) = 0(g(n)) 
means /(ft) = 0(g(n) log fc ft) for some integer k. 

Given any algorithm computing a function f(x) with probability at least 2/3, we can repeat 
this algorithm O (log (-)) times and take the majority vote of outcomes to boost the algorithm's 
success probability to 1 — e. Since all the subroutines in this paper are used at most poly(n) 
times, if we boost a bounded error subroutine's success probability to 1 — l/poly(n), then the final 
algorithm will still be correct with bounded error. This adds one log factor to the complexity of 
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the subroutine; this is the source of the log factors in this paper. We will also use this technique 
to boost the success probability of quantum algorithms that do not output a bit, but instead flip 
the phase when f(x) is 1. By standard techniques, such an algorithm can be converted to one 
that computes f{x) in a separate register. Now we can repeat this algorithm logarithmically many 
times, flip the phase conditioned on the majority vote of outcomes and uncompute the answers to 
obtain an algorithm whose success probability has been boosted to 1 — l/poly(n). 

The Johnson graph J(n, r) is a graph on f n ) vertices, where each vertex is an r-subset of 
{1, 2, . . . , n}, and two vertices are adjacent if their symmetric difference contains exactly two ele- 
ments. The spectral gap of J(n, r) is Q(l/r). 

Let G = (V, E) be a graph, and let R, S C V and F a set of edges on V, not necessarily 
contained in E. We denote by Gr the subgraph of G induced by R, and by Gr^ the bipartite 
subgraph of G induced by classes R and S. That is, Gr^s has vertex set R U S and edges set 
R x S n E. Denote by G{F) the subgraph of G obtained by restricting to edges in F. That is, 
G(F) has vertex set V and edges set E n F. 

2.2 Review of the quantum walk framework 

We now review the quantum walk search framework of Magniez, Nayak, Roland and San- 
tha [MNRS11]. We present a slight modification of the framework, which is tailored to our appli- 
cation. We discuss the minor differences between our presentation and the original framework at 
the end of this section. 

Let P be a reversible Markov chain on (finite) state space f2, with stationary distribution tt. 
Let V x : O — > %d, for some Hilbert space Tip, be a mapping that associates data with a state of 
the Markov chain for input x (i.e., for u £ Q, \T> x (u)) is the data associated with state u for input 
x). We will refer to {\T> x (u))} U £Q as the data structure associated with the Markov chain. 

In all previous applications of the quantum walk search framework, the data structure has 
always been classical, i.e., \T> x {u)) is a classical basis state for all u G 0, however, this needn't be 
the case in general. In our framework, we make use of quantum data structures. 

Setup Define Up = Up = Hn := span{{|0)} U {\u) : u G ft}}. Define the initial state | vr^) as 
\k x ) '■= Suen V^u\ u ) r\T> x (u)) d and let S = S(P,V X ) be the cost of constructing \n x ). We call 
S the setup cost. 

Update Define two unitary operators Up and IAd with the following action for all ufSl: 



In [MNRS11], these two operations are defined as one, however, we find it beneficial to consider 
them separately, since they play different roles: Up updates the walk according to the Markov chain 
P, while Up> updates the data structure. Since the Markov chain is independent of the input in all 
of our applications, the query complexity of implementing Up is 0, although its time complexity is 
non-zero. 

Let U = U(P, V x ) denote the cost of implementing Up,Up. We refer to U as the update cost. 

Checking Fix some set M x C fiof marked elements. Let C = C(M X ) be the cost of implementing 
the unitary that acts as 




and Up, : \u) L \v) R \V x (u)) H> \u) L \v) R \V x {v)). 
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We refer to C as the checking cost. With these definitions, we are ready to state the main theorem 
of [MNRS11]. 



Theorem 2.1. Let 5 > be the eigenvalue gap of P. Let e be such that for any x, if M x is 
non-empty, then ir(M x ) > e. Then there is a quantum algorithm with cost 



that, when given the initial state |vr x ), determines with bounded error whether M x is non-empty. 
More precisely, the algorithm implements (with bounded error) the map 



We will use of the above theorem, which is proven, though not explicitly stated in these terms, 
in [MNRS11]. The explicit theorem of [MNRS11] follows directly from this theorem, and is stated 
in Corollary 2.2. 

Corollary 2.2. There is a quantum algorithm with cost 



that determines with bounded error whether M x is non-empty. 

This presentation differs from the original presentation in some aspects. First, we allow the 
data structure to be quantum, while the original presentation implicitly assumed it to be classical. 
This, however, does not affect the proof. Second, the original presentation had a data structure 
associated with both registers of the walk, whereas we have only one data structure. As mentioned 
in [MNRS11], this is only for convenience. Third, we explicitly state the cost of the algorithm as 
a sum of two costs, the setup cost and the cost of running the walk. This is consistent with the 
original algorithm. 

3 Two- level nested quantum walks 

In this section we show how to augment the above framework with nested quantum walks using 
a quantum data structure. As motivation for this framework, it will be useful to compare the 
quantum walk algorithm of Magniez, Santha and Szegedy for triangle finding [MSS07], which has 
query complexity 0(n 3 ), with the learning graph algorithm of Belovs [Bell2b], which has query 
complexity 0(n 35//27 ) = 0(n 1,297 ). In Section 3.2, we formally state the general construction for 
the case of two-level nested quantum walks. In Section 4.1, we explain how to generalize to k levels 
of nesting. 

3.1 Warming up 

Quantum walk algorithm for triangle finding In the quantum walk algorithm of [MSS07], 
a quantum walk is used to search for a pair of triangle vertices. The walk is on the Johnson graph 
of sets of r vertices, J(n, r), which has spectral gap 5 = $7 (=J. A state is marked if it contains two 






vertices of a triangle, so the proportion of marked states is lower bounded by e = 0, 




6 



If G denotes the input graph, the data associated with a state R of the Johnson graph is 
\T>g(R)) = \Gr), which encodes the subgraph of G induced by the vertices in R. The setup cost 
is S = 0(r 2 ), since, for each state, we must initially query all (0 possible edges in the subgraph 
induced by R. The update cost is U = O(r), since, at each step of the walk, we remove a vertex i 
and add a new vertex i' to the set of vertices, and must therefore test the adjacency of i' with all 
remaining vertices in the set. 

Finally, to check whether we have two triangle vertices in a state R, we search for a vertex k 
that forms a triangle with two of the vertices in R. To check if a fixed vertex k forms a triangle 
with any two of the vertices in R, we solve the graph collision problem on Gr, which is encoded in 
T>g{R), with vertices in R defined as being marked if they are adjacent to k. Thus, we can check 
if a vertex k forms a triangle with two vertices in R using queries, and so we can check if 

any vertex forms a triangle with two vertices in R using C = 0(y/nr 2 ^) queries. 

Plugging all of these values into the formula from Corollary 2.2 gives (neglecting logarithmic 
factors) 

S + M^U + C) = ^ + ~r + ^ 2/3 ) 

which is optimized by setting r = n 3 / 5 , yielding query complexity 0(n L3 ) for triangle finding. 

Learning graph algorithm for triangle finding In contrast, the recent learning graph al- 
gorithm of Belovs [Bell2b] for triangle finding considers sets of r vertices, but only queries a 
sparsification of the possible edges between them. If the sparsification parameter is s € (0, 1] , then 
each potential edge is queried with probability s, giving an expected s^) queries for each state. 

We can try to apply this edge sparsification idea to the above quantum walk. We will redefine 
the data structure to be a random sparsification of the potential edges in Gr, so that the setup 
cost decreases to S = 0{sr 2 ) and similarly the update cost decreases to U = O(sr) (let us assume 
that the setup and update operations can be performed with their average cost, instead of their 
worst-case cost). However, now some states that contain two triangle vertices will not contain 
the edge between them, and will therefore not be detected by the checking procedure. Thus, the 

2 

probability that a state is marked has also decreased to e = Q(s^z). If we plug in these new values, 
we get query complexity 

s + M^u + c) = sr2 + * ^ sr + ^ r2/3 ) 

9 i 

= sr* + ny/rs + ^jw^, 

which is optimized by s = 1 and r = n 3 / 5 , yielding no improvement over the standard quantum 
walk algorithm. 

One key idea on which our framework is based, is to observe that a state R that contains two 
triangle vertices should be considered marked, even if its data structure is missing the edge between 
them. Of course, we need that edge to know that the state is marked, but we can check this by 
searching for a good data structure, i.e., one containing the triangle edge. To this end, we allow 
the data structure of a state R to be a quantum superposition over a number of classical data 
structures, some of which may contain some kind of certificate for R, if R is in fact marked. We 
walk on this data structure as a checking subroutine. Importantly, the setup of this inner walk, 
that is, the creation of an appropriate superposition of data structures, has already been taken care 
of as part of the setup of the outer walk. It is very important that we only perform this setup once, 
as opposed to at every checking step. 
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In Section 3.4, we use our framework to effectively apply the sparsification idea to triangle 
finding in a nested quantum walk. We first present the general framework for nesting quantum 
walks with a quantum data structure. We start with nesting one walk within another. 

3.2 Framework for nested quantum walks 

Let P be a reversible Markov chain on (finite) state space fi, with stationary distribution ir. For 
each input x, fix a marked set M x C VL. Let D x : £1 — > %y be some associated data mapping. The 
Markov chain P defines what we refer to as the outer walk. 

Fix «£(]. We will define what we refer to as the inner walk, whose purpose is to check whether 
u is in M x . Let P 1 be a reversible Markov chain on state space Q! with stationary distribution 7r'. 
Let M x C Q! be the marked set for the inner walk, with respect to u. Let T> x : O' — >■ T-Ly be the 
inner data mapping with respect to u. Note that, although we have the walk itself, defined by P' , 
Q' and tt' , independent of u, the marked set and data mapping are dependent on u. 

We suppose the following relationship between the inner and outer marked sets: 

M x = {u € Q : Ml ^ 0}. 

We can then view the set M x as a set of witnesses for u G M x . 

In order to make use of the quantum data structure, we suppose the following relationship holds 
for the inner and outer data structures. First, we require T-Ly = T~tn' ®~Hd'i an d in particular, we 
need 

\V x {u)) = Y J V<\s)\V u x {s)). 

sen 1 

That is, \D x (u)) is the initial state of the inner walk with respect to u. In general, the data structure 
for the outer walk could store more information than just the initial state of the inner walk, but 
this is all we need for our applications. 

Setup Define spaces U L = Hr = Hn and Uy = Ur> = %n>- The setup cost, S = S(P,V X ), is 
the cost of constructing the initial state: 

K") = V^\u)l\0)r\T> x (u)) d \0) r , 

= V^\u)l\0)r VK\s)y\V^s)) D/ \0) Rl . 

We emphasize that the initial state for the outer walk contains, in superposition, initial states for 
the inner walks with respect to each 

Update The update cost, U = U(P, V x ), is the cost of implementing UyUp. Recall that Up is 
defined by the action Up : \u)l\0)r \-t \u)l J2v&n \/ Pu,v\v) r- Similarly, Uy is defined by the action 

Uy : \u) L \v) R \V x (u)) D (->■ \u) l \v)r\V x (v)) d . 

This mapping is equivalently stated as: 

Uy : \u) L \v) R V^s\s)v\V u x {s))y ^ \u) L \v) R ^ ^ s \s) y\V v x {s)) D , , 
sen' sen 1 



8 



which can be implemented by an operation with the action: 

\u) L \v) R \V u x {s)) D , ^ \u) L \v) R \Vl{s)) Dl . 

The most important point to emphasize about this outer update is that we must update from the 
inner initial state with respect to some outer state u (contained in \D x (u))), to the inner initial 
state with respect to some other outer state v (contained in \V x (y))). 



Checking To check whether a state u is in M x , we want to implement the reflection 

C : \u)\V x {u)) ^ 



-\u)\V x {u)) ifueAf, 
\u)\V x (u)) else 



which, in this case, is: 



S ^ V \ M & s ev VK\s)\K(s))) else 

This is simply a reflection about the initial state of the inner walk with respect to u. Importantly, 
this preserves the inner initial state for future use, so we needn't set it up again. We accomplish 
this reflection by way of Theorem 2.1 applied to the inner walk. Let U' = \J'(P' ,T> X ) be an upper 
bound (over all inputs x and states u) on the update cost of the inner walk. That is, U' is the cost 
of implementing 

U'p : \s) L ,\0) R , ^ \a) v \fPL\t)R' and U 'd ■ \s) L>\t) r>\VI{s)) D , k> \s) L ,\t) R ,\V%(t)) D , . 
ten 1 

Let C = C'(M X ) be the checking cost of the inner walk, that is, the cost of implementing the 
reflection 

-\s)\v-(s)) ifseM- 

\s)\V2(s)) else 



i*>ra«)>- 

Let 5' denote the spectral gap of P' and e' denote a lower bound on tt'(M x ) for non-empty M x . 



Lemma 3.1. Let C be the cost of implementing C with bounded error. Then C 

(tf(w u ' + c ' 

Proof. This follows directly from Theorem 2.1, since \T> x (u)) is the initial state of the inner walk 
(with respect to u). □ 

Theorem 3.2. We can decide whether M x is non-empty, with bounded error, in quantum query 
complexity 

Proof. We can reduce the error in C from bounded error to error that is inverse polynomial at the 
cost of an additional log factor, as mentioned in Section 2. Since we only run for a polynomial 
number of steps, the result follows. □ 

This construction can be applied inductively to achieve nesting to any depth k, though the 
number of terms in the complexity will be proportional to k. Though the generalization to k levels 
is fairly straightforward, we give the explicit construction in Section 4.1. 
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3.3 Averaging costs 

In this section we prove a lemma about averaging costs of operations that will be useful in some of 
our nested walk applications. In particular, we sometimes have update costs which are much lower 
on average than in the worst case, and we would like to incur the average, rather than worst case 
update cost. 

Let {Ui} be a set of unitaries acting on the same space. If we know how to exactly implement 
each U{ with c% queries, then we can exactly implement the controlled unitary U = <8> Ui 

with maxj Cj queries. However, if the state we are implementing this unitary on has a nearly 
uniform distribution over the control register, then intuitively it seems like we should be able to 
approximately implement the unitary with a cost that goes like the average instead of the maximum. 
The following lemma makes this precise. 

Lemma 3.3. Let qi be the query complexity of exactly implementing unitary Ui. Let U be the 
controlled unitary U = ^ ® ^i- F° r anv prescribed upper bound q, we can convert the state 
\s) = Y2i cti\i)\si) to a state \t) using at most q queries, such that \t) has inner product at least 1 — e q 
with U\s) = oti\i)Ui\si) , where e q < Yli- qi > q \ a i\ 2 ■ Ln particular, with 0(^2 i \ai\ 2 qi) queries, we 
can achieve any constant e. 

Proof. The algorithm for implementing this conversion is simple: We only implement those matrices 
Ui whose cost is less than the prescribed upper bound. Otherwise we implement, for example, the 
identity matrix. More precisely, we implement the unitary U = Yli-q <q K)(*l®^»+Ei-gi>g 
Clearly this unitary costs at most q queries to implement. 

The inner product between U\s) and U\s) is Yli- qi < q \ a i\ 2 ( s i\ s i) + Yli-. qi > q \ a i\ 2 i s i\U\si) , which 
is at least T,i-. qi < q N 2 - Th us e q < V, :i/ .„ H 2 . 

To show the final portion of the theorem, consider the random variable that takes value qi with 
probability \ai\ 2 . By Markov's inequality, the probability that this random variable takes a value 
greater than k times the mean is less than 1/k. So, for any k > 1, if we set q = k(^2i \ a i\ 2 Qi)> we 
can achieve e < 1/k. □ 

Note that Markov's inequality uses very minimal information about the structure of the random 
variable. If the random variable has more structure, such as being tightly concentrated around the 
mean, then we can get much better estimates. The lemma is simple and constructive, and thus 
it can also be made efficient in terms of time and space if the approximate unitary U can be 
implemented efficiently. 

3.4 Two applications to triangle finding 

We can now present our first two nested quantum walk algorithms, both for the triangle finding 
problem. We first focus on the the learning graph of Belovs [Bell2b] since it is more challenging 
to implement due to the sparsification idea. Later we will sketch the quantum walk algorithm 
corresponding to the more recent and more efficient learning graph in [LMS13]. 

Theorem 3.4. There is a nested quantum walk algorithm for triangle finding with quantum query 
complexity O^n^f). 

Proof. Let the input graph be called G. The algorithm consists of two quantum walks, which we 
will call the inner and outer walk. 
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Outer walk The outer walk is a quantum walk on J (re, r), just as in the previous triangle finding 
algorithm, where 1 < r < n is a parameter to be chosen later. Each state of the Johnson graph 
corresponds to an r-subset of the vertex set of G. A state of the Johnson graph, R, is said to be 
marked if it contains two triangle vertices. This fully describes the Markov chain of the outer walk; 
we now have to describe the data structure associated with each state and the checking subroutine, 
which is the inner walk. As described in Section 3.2, the data structure associated with each state 
of the outer walk will be the initial state of the inner walk. 



Inner walk The inner walk is on J ( (2) ' s (2) ) ' wnere 1/r < s < 1 is & parameter to be chosen 
later. A state, T, of the inner walk is an s^-sized subset of the potential edges of an r-vertex 
graph. The data structure associated with a state T with outer state R, T> R (T), is the subgraph 
Gr{T). A state T is said to be marked if Gr(T) contains a triangle edge. The checking subroutine 
for this walk is the same as the checking step in the quantum walk described in Section 3: we search 
for a vertex outside of R that forms a triangle with one of the edges in Gr(T). 

Outer walk data structure We now return to describing the data structure of the outer walk. 
The quantum data structure corresponding to any state R of the outer walk will be the initial state 
of the inner quantum walk, i.e., the uniform superposition over all states of the inner walk. For the 

inner walk, the initial state is |7r^) = — \T\\G R (T)\ t where O2 is the state space of the 



inner walk, and IQ5 



(2) 




Inner walk cost analysis We can now compute the costs of the various operations in the inner 
walk. In the inner walk, the checking step is the same as the one described in Section 3.1 and thus 
costs C2 = 0(y/nr 2 / 3 ). The update operation just deletes and adds a new edge, giving U2 = 0(1)- 
The spectral gap of the Johnson walk is 62 = 0(l/sr 2 ) and the fraction of marked states, assuming 
the outer state is marked, is £2 > s. Thus the cost of performing the inner walk, given the inital 
state for the walk, which is the checking cost for the outer walk, is 

r ~ ( W U 2 




^2/3 



Since the checking step with cost C2 is only bounded error, we need to amplify the success proba- 
bility, which leads to the log factor subsumed in the O notation. 

Outer walk cost analysis Now we can move to the outer walk. The superposition corresponding 
to a state in the outer walk is 

\R)\^) = \R) £ -^\T)\G R (T)). 

TeQ'2 V l iZ2 l 

Setting up this superposition clearly costs Si = sr 2 queries. A state is considered marked if it 
contains both vertices of a triangle edge. Thus £1 = fi ^(^) 2 ^- The spectral gap of this walk is 

61 = n(i/r). 
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Finally, we have to compute the update cost. The update cost is the cost of performing the 
map 

1 



to 



Ten 2 



\T)\G RI {T)). 




Since R and R' are adjacent states in a Johnson graph, R has one vertex not in R' , and R' has one 
vertex not in R. Let us call these vertices a and a' respectively. The cost of updating \Gr(T)) to 
\Gri(T)) is therefore deg(a) + deg(a'). In the worst case the degree of a vertex is r. However, on 
average any vertex has degree sr. Now we can invoke Lemma 3.3. For a given query upper bound 
q, e q is the probability that a randomly selected vertex will have degree greater than q. If we set 
q = 7(sr) + 100 log n, then we need to compute the probability that the degree of a vertex is more 
than 7 times its mean (the extra term is to deal with the case when sr is very small) . The degree of 
a vertex is a random variable distributed according to the hypergeometric distribution, for which 
strong tail bounds are known. By Theorem 2.10 and eq. (2.11) of [JLR11], e q < exp(— q) for any 
q at least 7 times the mean. In our case, this gives e q < exp(— 7sr)^ OT , which is sufficient for our 
purposes. 

Thus the final query complexity is 

0fs 1 + 4=(^L + Cl 



7.3/2 



O i^sr 2 + - ^sr y/r + r + y^r 2/3 ^ \ 
O sr 2 + ns\/r + n + 



which is optimized by choosing r = n 2 / 3 and s = n 1 / 27 , giving a final query complexity of 
0(n 35 / 27 ). □ 

We note that the optimal values of r and s are the same as those obtained in the learning 
graph upper bound. In fact, if we add some unnecessary layers of nesting by breaking up the outer 
walk into several nested walks, we can obtained the learning graph complexity term for term. This 
suggests a natural correspondence between these two frameworks. 

To emphasize a key ingredient in our framework, we note that the inner update is much cheaper 
than the outer update, since the inner update involves changing a single edge, whereas the outer 
update involves changing all edges adjacent to a single vertex. This is ideal, since the inner update 
operation is performed much more frequently than the outer update. 

A very recent unpublished result of Lee, Magniez, and Santha [LMS13] constructs a learning 
graph for triangle finding with query complexity 0(n 9 / 7 ) = 0(n L286 ). We can easily convert their 
learning graph to a very simple nested walk. Due to its simplicity, it is perhaps less illustrative 
of the power of the framework than the original learning graph for triangle finding, however, it 
illustrates the cost savings that can be obtained by performing expensive updates less frequently 
(by keeping them in the outer walk). 

Theorem 3.5. There is a nested quantum walk algorithm for triangle finding with query complexity 
0(n 9 / 7 ). 
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Proof. We will make use of the fact that collision finding on an unbalanced bipartite graph on classes 
of size T\ and r<i with r\ < T2 < r\ has quantum query complexity 0({r\r2) 1 ^) (see Appendix A). 

Outer walk The outer walk is on J(ra,ri), for 1 < r\ < n. A state R\ is marked if it contains a 
triangle vertex. Thus, we have 5\ = ^(^-) and E\ = 

Inner walk The inner walk is on J(n, r2), for 1 < < n. Suppose the outer state, R\, is marked, 
so it contains the vertex from some triangle. A state i?2 is marked if it contains a vertex from the 
same triangle. 

The data structure associated with a state R2 with outer state R\, T> Rl {R2), is the subgraph of 
G containing exactly those edges with one endpoint in R±, and one endpoint in R2, G Rlt R 2 . Thus, 
8\ = and £2 = ^(^)- To update, we must query r\ potential edges from a new vertex to 

R\, so U2 = r\. To check if a state is marked, we search for a third triangle vertex, using a graph 
collision subroutine, with total cost C2 = i/n^i^) 1 / 3 . 



Outer walk data structure The quantum data structure corresponding to a state R\ of the 
outer walk is ^2 Ri } \R%)\ Gr 1: r 2 )- The cost of learning G Ri ^r 2 is Si = r\T2, and the cost of 

updating G Rl)R2 to G R i R2 is Ui = r2, since we must query T2 potential edges between the new 
vertex and all of i?2- 

The final query complexity is (neglecting log factors) 

Si + -U f-^Ui + -U ( ^u 2 + c 2 



'2 



n 3/2 

rir 2 + y/nr 2 + n*Jr{ + 1/6 1/6 ■ 



Setting ri = n 4 / 7 and r2 = n 5 / 7 gives the claimed upper bound. Note that the optimal values have 
T2 larger than r±, making the inner update cheaper than the outer update. The inner update is 
run more often, not only because > -^=, but more fundamentally, because the term -^=^2 has 

a factor of — . whereas the term — i=Ui only has a factor of —k=. □ 



For this walk, in some sense, the inner marked set is independent of the outer state. To see this, 
consider the case where there is a single triangle, and note that this is the edge case from which we 
calculate a lower bound on the probability that a state is marked. In the case of a single triangle, 
we can imagine the inner and outer walks being on independent sets of vertices, R± and R2, each 
of which is marked if it contains a triangle vertex. (Of course, they should not contain the same 
triangle vertex.) In some sense then, this algorithm does not use the full power of the model. 

4 Multi-level nested quantum walks 
4.1 Generalization of the framework 

The simple two-level nesting from Section 3 can be generalized in a straightforward way to k levels 
of nesting. 
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Consider k walks {(P^ fi'*', 7rW)}* =1 . Suppose the i th walk has marked sets 

{ACIweoMx-.-xnC*- 1 ) 

(with M x implicitly being the unique marked set for the outermost walk) and data mappings 

: fi W -)• H £) (i)} u6n (i) x ... xn (i-i). 

Suppose that for alH = 1, . . . , k and all u G f^ 1 ) x • • • x , we have 

M£ = {ve fiW : Mj u ^ / 0}, 

and for i = 1, . . . , A; — 1, u G Sl^ 1 ) x • • • x f^ 1-1 ) and v G fiW, we have = M^fi+i) <g> H D ( i+1 ), 

and 



Mjesj('+ 1 ) 

Let Si denote the spectral gap of PW, and £j be a lower bound for non- empty ir®(M%) for all 
u G r^ 1 ) x • • • x f^ 4 " 1 ). Let S denote the cost of constructing 



^ V^>)i^(^)>. 

Let LI, be the update cost of the i th walk. Let C be the cost of implementing, for w G fi(% and 
it G x ••• x fi^" 1 ), 



|u>)|X>"(?i;)) else 



Corollary 4.1. We can decide whether M x is non-empty with bounded error in 

k I i Ik 



queries. 



This is not the most general nested walk. Rather than have a sing le i th walk (pW 5 nW , 7rW), we 
could have a family of i th walks, 0", vr M )} uen( i) x ... xQ (^i) , where ftW = UueaWx-xQ^- 1 ) 

However, such a general framework sacrifices the simplicity that is a feature of nested quantum 
walks. 



4.2 Application to subgraph containment 

It is simple to extend our quantum walk algorithms for triangle finding to obtain the more general 
learning graph upper bounds for subgraph containment [LMS11, Zhull, LMS13]. We now give an 
explicit construction of nested quantum walks corresponding to the learning graphs of [LMS13], 
matching their complexity up to polylogarithmic factors, and even potentially improving their result 
for some cases. For simplicity consider the case of undirected graphs with no self-loops. Let H be 
a graph on vertices {1,2, ... ,k}. Then a graph G contains a copy of H if G induces a graph that 
is isomorphic to H on some subset of k vertices. 
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The learning graph construction In [LMS13], a meta-language is introduced for designing a 
learning graph that detects a copy of H in G. Instructions are Setup, LoadVertex(i) for i 6 [k] and 
LoadEdge (i, j) for € E{H). A 'program' is an ordering of {Setup} U {LoadVertex(i)} i6 [ fc ] U 

{LoadEdge(z, j)}u j}£E(H) such that Setup appears first and for each edge {i,j} of H, instruction 
LoadEdge(i, j) occurs in the program after instructions LoadVertex(z) and LoadVertex(j) . 

Parameters are vertex-set sizes rj > 1 for each vertex i of H, and average degrees dij > 1 for each 
edge {i,j} of H. These parameters are not related to H but parameterize the resulting learning 
graph. Assuming some natural conditions on these parameters, one can derive an algorithm whose 
complexity only depends on them. Optimizing them can be done via a linear program (see [LMS13] 
for a link to the program). We assume that rj < n, dij < max(rj,rj), and that: 

For all i there exists j such that {i,j} £ E(H) and dij(2rj + l)/(2rj + 1) > 1. (2) 

Each instruction has a cost that propagates to later instructions, and its total effective cost. 
The first one is called global cost and the second one local cost. In [LMS13], a table is given, nearly 
identical to Table 1, providing global and local costs for each of the three types of instructions. 
Given a program of r = k + \E(H)\ + 1 instructions, if It denotes the local cost of instruction t, 
and gt the global cost, then the resulting upper bound for deciding if H is in G is 

Efn^V (3) 

4=1 \t'<t / 

The corresponding nested walk Given a program, we can define a sequence of nested walks as 
follows. The Setup instruction corresponds to our quantum walk setup. Each non-setup instruction 
corresponds to a quantum walk, with the walk corresponding to the i th instruction being nested 
at depth t; that is, the t th walk is the checking procedure for the (t — l) th walk. Each non-setup 
instruction corresponds to an edge or vertex in H, and so each of our walks will as well. 

All walks will be on Johnson graphs. The state of the walk corresponding to a vertex i is 
described by a subset of vertices Ri of size rj, and the state of the walk corresponding to an edge 
{i,j} is described by a subset Tjj of d^ min(rj,rj) potential edges between Ri and Rj. For each 
edge {i, j} of H, our database maintains the edges G(Tij) of G in Tjj. 

A state Ri is marked if there exists some copy of H in G such that: for each Rj in a walk at 
depth less than the depth of Ri, Rj contains the j th vertex in the subgraph, and for each T uv at 
depth less than the depth of Ri, T uv contains edge {u, v} in the subgraph, and finally, Ri contains 
the i th vertex in the subgraph. A state Tij has a similar condition for being marked, except Tij 
must contain edge {i,j} in the subgraph. 

Setup To setup, we must create the uniform superposition \ir) of subgraphs G(Tij), one for each 
edge {i, j} £ E(H). The total cost is 

S= xrim(ri,rj)dij. 
{ij}6H 

Vertex walks To update a walk on vertex set Ri we remove a vertex and add a new one. To 
update the data, we must update the graphs G(Tij), for each j adjacent to i in H. By Lemma 3.3, 
this complexity is related to the average degree in Ri, which we can calculate as dij min(l, rj/rj). 
The total cost for this type of update is then 

U= ^2 dij min(l,r j/ri). 
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Instruction 


Global Cost 


Local Cost 


Setup 


1 


S= xmn{ri,rj}dij 


LoadVertex(i) 




-±=U = V™ X dijnun(l,rj/ri) 


LoadEdge(i, j) 


= v niax{r,. } d , j 





Table 1: Global and local costs for quantum walks corresponding to each type of instruction. 

Observe that this quantity is always 0(1), by the imposed condition in (2). 

We can easily calculate that the proportion of marked states in this type of walk is e = ri/n, 
and the spectral gap is 5 = l/r%. 

Edge walks Lastly, to update a walk on edge set Tjj we simply replace one edge in and 
update G(Tij). Thus the cost of this type of update is simply U = 1. For this type of walk we 
have e = dij/ max(rj,rj) (since e is the probability that a particular edge of Ri x Rj is in Ty) and 
5 = 1/ min(rj, rj). 

Checking A state of the deepest walk is marked if the data encodes an entire copy of H in G. 
Since we can detect this from the data with no further queries to G, the final checking cost is C = 0. 

Global and local costs We can rephrase our costs for each walk into global and local costs. 
The global cost is simply the part of the cost that propagates forward to deeper walk costs. For 
the setup (which we can consider as a walk on a complete graph in which every state is marked), 
this is simply 1, whereas for other walks, the global cost is (see (1)). Similarly, we can consider 

local costs. For the setup this is defined as S, whereas for each other walk, it is defined as -7=U. 

V £0 

We list all global and local costs in Table 1. 

Comparing (1) and (3), we see that the upper bounds we achieve for subgraph detection are the 
same as those of [LMS13] (up to polylogarithmic factors), as a function of the global and local costs. 
Furthermore, our expressions for global and local costs are almost identical to those of [LMS13]. 
The only exception is that, depending on some sparsity condition, namely whether dij mm(n,rj) < 
max(r,j, rj), the local cost of LoadEdge(i, j) has two different expressions in [LMS13]: either ^Wj 
(when the condition holds) or max(rj,rj) (otherwise). Therefore, ignoring polylogarithmic factors, 
our total complexity is the same as that of [LMS13] or potentially better in some cases, because 
of our potentially better edge-local cost, though we know of no particular graph for which we have 
asymptotically better complexity. We remark that we believe that this improvement in edge-local 
cost could also be made in the learning graph construction of [LMS13], however, it isn't immediately 
clear how to do so in that setting, whereas in the quantum walk setting it is natural and immediate. 

5 Notes on other learning graphs 

In [Bell2a], a learning graph algorithm for graph collision is presented. We note that this is, in 
fact, based on an unpublished quantum walk algorithm due to Ambainis [Bell2a], which has the 
same complexity, up to log factors. 
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As to the possibility of reproducing the recent learning graph upper bound for k- 
distinctness [Bell2a], while this upper bound is achieved via a method that deviates significantly 
from the original learning graph framework, we do believe that we should be able to construct a 
quantum walk algorithm with matching complexity. 
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A Graph collision subroutine 



In this section, we show how to compute graph collision on a bipartite graph with classes of size 
r\ and r 2 in quantum query complexity O ((ri?^) 1 / 3 ) . The algorithm is a straightforward gener- 
alization of the standard graph collision algorithm of [MSS07], based on the element distinctness 
algorithm of [Amb04]. Note that when ri = r 2 , the well-known O yi^^j upper bound is achieved. 

Lemma A.l. The quantum query complexity of graph collision on a bipartite graph with classes 
of size r\ and r2, when r\ < r 2 < r\ , is in O ((n^) 1 / 3 ) • 

Proof. Let G be a bipartite graph on classes -Ri and R2 with \R\\ = r\ and |i? 2 | = r 2 . We proceed 
as in the standard graph collision algorithm: let m < r\ be a parameter to be optimized later. We 
walk on the product graph J(r\,m) x J(r 2 , m), where a state S±, S2 corresponds to a pair of subsets; 
Si is an m-subset of R\ and S" 2 is an m-subset of i? 2 . Two states Si, 5 2 and S[, S' 2 are adjacent in 
J(ri,m) x J(r 2 ,m) if S\ is adjacent to S[ in J(ri,m) and 5 2 is adjacent to S' 2 in J(r 2 ,m). 

The data associated with a state Si, 5 2 is the set of query values for vertices i G Si U 5 2 . Thus, 
the setup cost is S = 0(m), and the update cost is U = 0(1). A state is marked if there is a pair 
ii £ Si and i 2 G S2 such that ii is adjacent to i 2 and both are marked. Since the data contains 
query values for both i and i! , checking costs C = queries. The proportion of marked states is 
at least e = Q The spectral gap of J(ri,m) x J(r 2 ,m) is 5 = (^)- Thus, the query 

complexity is 

n ( , Vn^ ^\ 

(J m-\ v m i 

\ m J 

which is optimized by setting m = (r , ir 2 ) 1 / 3 (which is smaller than t\ when r 2 < rf), yielding 
quantum query complexity O ((rir 2 ) 1 / 3 ). □ 



19 



